
PROTEIN ACTIVITY SCREENING 
OF CLONES HAVING DNA FROM UNCULTIVATED MICROORGANISMS 
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3 This invention relates to the field of preparing and screening libraries of clones 

h containing microbially derived DNA. 

' U Naturally occurring assemblages of microorganisms often encompass a bewildering array 

^ of physiological and metabolic diversity. In fact, it has been estimated that to date less than one 
i percent of the worid's organisms have been cultured. It has been suggested that a large fi^ction 
=^ of this diversity thus far has been unrecognized due to difficulties in enriching and isolating 
i microorganisms in pure culture. Therefore, it has been difficult or impossible to identify or 
^ isolate valuable proteins, e.g. enzymes, from these samples. These limitations suggest the need 
for altemative approaches to characterize the physiological and metabolic potential, i e. activities 
of interest of as-yet uncultivated microorganisms, which to date have been characterized solely 
by analyses of PCR amplified rRNA gene firagments, clonally recovered firom mixed assemblage 
nucleic acids. 

In one aspect, the invention provides a process of screening clones having DNA firom an 
uncultivated microorganism for a specified protein, e.g. enzyme, activity which process 
comprises: 

screening for a specified protein, e.g enzyme, activity in a library of clones prepared by 
(i) recovering DNA fi-om a DNA population derived firom at least one uncultivated 



microorganism; and 

(ii) transforming a host with recovered DNA to produce a library of clones which are 
screened for the specified protein, e.g. enzyme, activity. 

The library is produced from DNA which is recovered without culturing of an organism, 
particularly where the DNA is recovered from an environmental sample containing 
microorganisms which are not or cannot be cultured. 

In a preferred embodiment DNA is ligated into a vector, particularly wherein the vector 
further comprises expression regulatory sequences which can control and regulate the production 
of a detectable proteins, e.g. enzyme, activity from the ligated DNA. 

V^^ The f-factor (or fertility factor) in E. a plasmid which effects high frequency 

\ansfer of itself during conjugation and less frequent transfer of the bacterial chromosome itself 
To achieve and stably propogate large DNA fragments from mixed microbial samples, a 
particularly preferred embodiment is to use a cloning vector containing an f-factor origin of 
replication to generate genomic libraries that can be replicated with a high degree of fidelity. 
When integrated with DNA from a mixed uncultured environmental sample, this makes it 
possible to achieve large genomic fragments in the form of a stable "enviromnental DNA 
Hbrary." 

In another preferred embodiment, double stranded DNA obtained from the uncultivated 

DNA population is selected by: 

converting the double stranded genomic DNA into single stranded DNA; 

recovering from the converted single stranded DNA single stranded DNA which 
specifically binds, such as by hybridization, to a probe DNA sequence; and 

converting recovered single stranded DNA to double stranded DNA. 

The probe may be directly or indirectly bound to a soUd phase by which it is separated 
from single stranded DNA which is not hybridized or otherwise specifically bound to the probe. 



# 



The process can also include releasing single stranded DNA from said probe after 
recovering said hybridized or otherwise bound single stranded DNA and amplifying the single 
stranded DNA so released prior to converting it to double stranded DNA. 

s/i Vh-VJ/* The invention alsd^provides a process of screening clones having DNA from an 

uncultivated microorganisms for a specified protein, e.g ei^fee, activity which compnses 
screening for a specified gehe cluster protein product activity in the library of clones prepared 
by: (i) recovering DNA froA a DNA population derived from at least one uncultivated 
'O^C^microorganism; and (ii) traiLforming a host with recovered DNA to produce a library of clones 
with the screens for the specified protein, e.g enzyme, activity. The library is produced from 
gene cluster DNA which is riovered without culturing of an organism, particularly where the 
DNA gene clusters are recovAed from an environmental sample containing microorganisms 
which are not or cannot be cultured. 
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Alternatively, double-stranded gene cluster DNA obtained from the uncultivated DNA 
population is selected by converting the double-stranded genomic gene cluster DNA into single- 
;i stranded DNA; recovering from the converted single-sfranded gene cluster polycistron DNA, 
^ single-stranded DNA which specifically binds, such as by 

1 hybridization, to a polynucleotide probe sequence; and converting recovered singlestranded gene 
20 cluster DNA to double-stranded DNA. 

These and other aspects of the present invention are described with respect to particular 
preferred embodiments and will be apparent to those skilled in the art from the teachings herein. 
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The microorganisms fromYhich the libraries may be prepared include prokaryotic 
'fmcroorganisms, such as EubacteriV and Archacbacteria, and lower eukaryotic microorganisms 
such as fungi, some algae and protokoa. The microorganisms are uncultured microorganisms 
obtained from environmental sampled and such microorganisms may be extremophiles, such as 
thermophiles, hyperthermophiles, psydJirophiles, psychrotrophs, etc. 
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As indicated above, the library is produced from DNA which is recovered without 
culturing of an organism, particularly where the DNA is recovered from an environmental 
sample containing microorganisms which are not or cannot be cultured. Sources of 
microorganism DNA as a starting material library from which DNA is obtained are particularly 
contemplated to include environmental samples, such as microbial samples obtained from Arctic 
and Antarctic ice, water or permafrost sources, materials of volcanic origin, materials from soil 
or plant sources in tropical areas, etc. Thus, for example, genomic DNA may be recovered from 
either uncultured or non-culturable organism and employed to produce an appropriate library of 
clones for subsequent determination of protein, e.g enzyme, activity. 

Bacteria and many eukaryotes have a coordinated mechanism for regulating genes whose 
products are involved in related processes. The genes are clustered, in structures referred to as 
"gene clusters," on a single chromosome and are transcribed together under the control of a 
single regulatory sequence, including a single promoter which initiates transcription of the entire 
cluster. The gene cluster, the promoter, and additional sequences that function in regulation 
altogether are referred to as an "operon" and can include up to 20 or more genes, usually from 2 
to 6 genes. Thus, a gene cluster is a group of adjacent genes that are either identical or related, 
usually as to their function. 

Some gene families consist of identical members. Clustering is a prerequisite for 
maintaining identity between genes, although clustered genes are not necessarily identical. Gene 
clusters range from extremes where a duplication is generated to adjacent related genes to cases 
where hundreds of identical genes lie in a tandem array. Sometimes no significance is 
discemable in a repetition of a particular gene. A principal example of this is the expressed 
duplicate insulin genes in some species, whereas a single insulin gene is adequate in other 
mammalian species. 

It is important to fiirther research gene clusters and the extent to which the full length of 
the cluster is necessary for the expression of the proteins resulting therefrom. Further, gene 
clusters undergo continual reorganization and, thus, the ability to create heterogeneous libraries 



of gene clusters from, for example, bacterial or other prokaryote sources is valuable in 
determining sources of novel proteins, particularly including proteins, e.g. enzymes, such as, for 
example, the polyketide syntheses that are responsible for the synthesis of polyketides having a 
vast array of usefol activities. Other types of proteins that are the product(s) of gene clusters are 
also contemplated, including, for example, antibiotics, antivirals, antitumor agents and 
regulatory proteins, such as insulin. 



Polyketides are molUles which are an extremely rich source of bioactivities, including 
antibiotics (such as tetracyclLs and erythromycin), anti-cancer agents (daunomycin), 
immunosuppressants (FKSOsVnd rapamycin), and veterinary products (monensin). Many 
polyketides (produced by polyLide syntheses) are valuable as therapeutic agents. Polyketide 
syntheses are multifunctional pLins, e.g. enzymes, that catalyze the biosynthesis of a hugh 
variety of carbon chains differing in length and patterns of functionality and cyclization. 
Polyketide synthase genes fall irX gene clusters and at least one type (designated type I) of 
polyketide syntheses have large siL genes and proteins, e.g. enzymes, complicating genetic 
manipulation and in v"ro studies oAthese genes/proteins. 
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The ability to select and combine desired components from a Ubrary of polyketides and 
postpolyketide biosynthesis genes for generation of novel polyketides for study is appealing. The 
method(s) of the present invention make it possible to and facilitate the cloning of novel 
polyketide syntheses, since one can generate gene banks with clones containing large inserts 
(especially when using the f-factor based vectors), which facilitates cloning of gene clusters. 



/ Preferably, the gene cluLr DNA is ligated into a vector, particularly wherein a vector 
fiirther comprises expression reg^atoiy sequences which can control and regulate the production 
of a detectable protein or protein-Uted array activity from the Ugated gene clusters. Use of 
.vectors which have an exceptionall\ large capacity for exogenous DNA introduction are 
Iparticularly appropriate for use withW gene clusters and are described by waj. of example 
herein to include the f-factor (or fertilL factor) of E. coU. This f-factor of E. oeidis a plasmid 
y which affect highfrequency transfer of Llf during conjugation and is ideal to achieve and stably 
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propagate large DNA fragments, such as gene clustersjfeir. mixed microbial samples. 



Tlie term "derived" or "isolated" means that material is removed from its original 
environment (e.g., the natural environment if it is naturally occurring). For example, a naturally- 
occurring polynucleotide or polypeptide present in a living animal is not isolated, but the same 
polynucleotide or polypeptide separated from some or all of the coexisting materials in the 
natural system, is isolated. 

The DNA isolated or derived from these microorganisms can preferably be inserted into a 
vector prior to probing for selected DNA. Such vectors are preferably those containing 
expression regulatory sequences, including promoters, enhancers and the like. Such 
polynucleotides can be part of a vector and/or a composition and still be isolated, in that such 
vector or composition is not part of its natural enviromnent. Particularly preferred phage or 
plasmid and methods for introduction and packaging into them are described in detail in the 
protocol set forth herein. 



^T^J^The following oVtlines a general procedure for producing libraries fi-om nonculturable 
" orgasms, which libraril can be probed to select therefrom DNA sequences which hybridize to 

specified probe DNA: 
20 0'^ Obtain Biomass 

DNA Isolation 

Shear DNA (25 gauge nWdle) 
Blunt DNA (Mung Bean Nuclease) 
Methylate (EcoR I Methylake) 
25 Ligate to EcoR I linkers (GGAATTCC) 

Cut back linkers (EcoR I Restiiction Endonuclease) 
Size Fractionate (Sucrose Gradient) 

Ligate to lambda vector (LambdkzAP® (Stratagene) and gtl 1) 
A I Package (in vjtro lambda packagirig extract) 
30 / Plate on E.'^^ host and amplify \ 



The probe DNA used for selectively recovering DNA of interest from the DNA derived 
from the at least one uncultured microorganism can be a full-length coding region sequence or a 
partial coding region sequence of DNA for an protein, e.g. enzyme, of known activity, a 
phylogenetic marker or other identified DNA sequence. The original DNA library can be 

5 preferably probed using mixtures of probes comprising at least a portion of the DNA sequence 
encoding the specified activity. These probes or probe libraries are preferably single-stranded 
and the microbial DNA which is probed has preferably been converted into single-stranded form. 
The probes that are particularly suitable are those derived fi-om DNA encoding proteins, e.g. 
enzymes, having an activity similar or identical to the specified protein, e.g enzyme, activity 

10 which is to be screened. 

!^ The probe DNA should be at least about 10 bases and preferably at least 15 bases. \n one 

if; embodiment, the entire coding region may be employed as a probe. Conditions for the 

hybridization in which DNA is selectively isolated by the use of at least one DNA probe will be 
15 S designed to provide a hybridization stringency of at least about 50% sequence identity, more 
particularly a stringency providing for a sequence identity of at least about 70%. 

li Hybridization techniques for probing a microbial DNA library to isolate DNA of 

:g potential interest are well known in the art and any of those which are described in the literature 
suitable for use herein, particularly those which use a solid phasebound, directly or indirectly 
bound, probe DNA for ease in separation from the remainder of the DNA derived from the 



20 are 



25 



microorganisms. 

Preferably the probe DNA is "labeled" with one partner of a specific binding pair (i.e. a 
ligand) and the other partner of the pair is bound to a solid matrix to provide ease of separation 
of target from its source. The ligand and specific binding partner can be selected from, in either 
orientation, the following: (1 ) an antigen or hapten and an antibody or specific binding fragment 
thereof; (2) biotin or iminobiotin and avidin or streptavidin; (3) a sugar and a lectin specific 
therefor; (4) a protein, e.g. enzyme, and an inhibitor therefor; (5) an apoenzyme and cofactor; (6) 
30 complementary homopolymeric oligonucleotides; and (7) a hormone and a receptor therefor. The 

7 
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solid phase is preferably selected from: (1) a glass or polymeric surface; (2) a packed column of 
polymeric beads: and (3) magnetic or paramagnetic particles. 

The library of clones prepared as described above can be screened directly for enzymatic 
activity without the need for culture expansion, amplification or other supplementary procedures. 
However, in one preferred embodiment, it is considered desirable to amplify the DNA recovered 
from the individual clones such as by 
PCR. 

Further, it is optional but desirable to perform an amplification of the target DNA that has 
been isolated. In this embodiment the selectively isolated DNA is separated fi-om the probe DNA 
after isolation. It is then amplified before being used to transform hosts. The double stranded 
DNA selected to include as at least a portion thereof a predetermined DNA sequence can be 
rendered single stranded, subjected to amplification and reannealed to provide amplified 
numbers of selected double stranded DNA. Numerous amplification methodologies are now well 
known in the art. 

The selected DNA is then used for preparing a library for screening by transforming a 
suitable organism. Hosts, particularly those specifically identified herein as preferred, are 
transformed by artificial introduction of the vectors containing the target DNA by inoculation 
under conditions conducive for such transformation. 



The resultant libraries of transformed clones are then screened for clones which display 
activity for the protein, e.g. enzyme, of interest in a phenotypic assay for protein, e.g. enzyme, 
25 activity. 

Having prepared a multiplicity of clones from DNA selectively isolated fi-om an 
organism, such clones are screened for a specific protein, e.g. enzyme, activity and to identify 
the clones having the specified protein, e.g enzyme, characteristics. 

30 
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jy>^ The screening for pVotein, e.g. enzyme, activity may be effected on individual expression 

clones or may be initially effected on a mixture of expression clones to ascertain whether or not 
the mixture has one or mores^ecified protein, e.g. enzyme, activities. If the mixture has a 
specified protein, e.g. S^^lfeVtivity, then the individual clones may be rescreened for such 
protein, e.g. enzyme, activity or for a more specific activity. Thus, for example, if a clone 
mixture has hydrolase activity, th\n the individual clones may be recovered and screened to 
determine which of such clones hakhydrolase activity. 

The DNA derived from a microorganism(s) is preferably inserted into an appropriate 
vector (generally a vector containing suitable regulatory sequences for effecting expression) 
prior to subjecting such DNA to a selection procedure to select and isolate therefrom DNA 
which hybridizes to DNA derived from DNA encoding an proteins, e.g. enzyme(s), having the 
specified protein, e.g. enzyme, activity. 
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As representative examples of expression vectors which may be used there may be 
mentioned viral particles, baculovirus, phage, plasmids, phagemids. cosmids, phosmids, bacterial 
artificial chromosomes, viral DNA (e.g. vaccinia, adenovirus, foul pox virus, pseudorabies and 
derivatives of SV40), Pl-based artificial chromosomes, yeast plasmids, yeast artificial 
chromosomes, and any other vectors specific for specific hosts of interest (such as bacillus, 
aspergiUus, yeast, etc.) Thus, for example, the DNA may be included in any one of a variety of 
expression vectors for expressing a polypeptide. Such vectors include chromosomal, 
nonchromosomal and synthetic DNA sequences. Large numbers of suitable vectors are known to 
those of skill in the art, and are commercially available. The following vectors are provided by 
way of example; Bacterial: pQE70, pQE60, pQE-9 (Qiagen), pBLUESCRIPT SK, 
pBLUESCRIPT KS (Stratagene); pTRC99a, pKK223-3, pDR540, pRIT2T (Pharmacia); 
Eukaryotic: pWLNEO, pXTI, pSG5 (Stratagene) pSVK3, pBPV, pMSG. pSVLSV40 
(Pharmacia). However, any other plasmid or vector may be used as long as they are replicable 
and viable in the host. 
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^ y A particularly preferre\d type of vector for use in the present invention contains an f- 

fector origin of replication. A-factor (or fertility factor) in E. coli is a plasmid which effects 
high frequency transfer of itself Lng conjugation and less frequent transfer of the bacterial 
chromosome itself. A particularly Referred embodiment is to use cloning vectors, referred to as 
"fosmids" or bacterial artificial chroVosome (BAC) vectors. These are derived from the E. cold 
f-factor and are able to stably integratVarge segments of genomic DNA. When integrated with 
DNA from a mixed uncultured envirolental sample, this makes it possible to achieve large 
genomic fragments in the form of a stable? "enviromnental DNA library." 
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The DNA derived from a microorganism(s) may be inserted into the vector by a variety 
of procedures. In general, the DNA sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. Such procedures and others are deemed to 
be within the scope of those skilled in the ait. 

The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mBNA synthesis. Particular named baCenal 
promoters include lacl. lacZ, T3. T7, apt, lambda PR, PL and trp. Eukaryotic promotets tnclude 
CMV immediate early, HSV thymidine kinase, early and late SV40. LTte ftom ret^virus, and 
mouse metallotnionein-1. Selection of the appropriate vector and promoter is well wtthm the 
level of ordinary skill in the art. The expression vector also contains a ribosome btndtng stte for 
translation initiation and a transcription temtinator. The vector may also include appropnate 
sequences for amplifying expression. Promoter regions can be selected from any desned gene 



using 



CAT (chloramphenicol transferase) vectors or other vectors with selectable markers. 



m addition, the expression vectors preferably contain one or more selectable market 
genes to provide a phenotypic trait for selection of transfomted host cells such as dihydrofolate 
reductase or neomycin resistance for eukaryotic cell culture, or such as tetracycline or amp.allm 

resistance in E. coli. 
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Generally, recontbinant expression vectors will include origins of replication and 
lectaj)^e markers permitting transformation of the host cell, e.g, the ampicillin resistance gene 
of E. eol3^and S. cerevisiae TRPl gene, and a promoter derived from a highly-expressed gene to 
direct transcription of a downstream structural sequence. Such promoters can be derived from 
operons encoding glycolytic pro\eins, e.g enzymes, such as 3-phosphoglycerate kinase (PGK), a- 
factor, acid phosphatase, or heat siaock proteins, among others. The heterologous structural 
sequence is assembled in appropriate phase with translation initiation and termination sequences, 
and preferably, a leader sequence capable of directing secretion of translated protein into the 
periplasmic space or extracellular medium. 

The DNA selected and isolated as hereinabove described is introduced into a suitable 
host to prepare a library which is screened for the desired protein, e.g. enzyme, activity. The 
selected DNA is preferably already in a vector which includes appropriate control sequences 
whereby selected DNA which encodes for an protein, e.g enzyme, may be expressed, for 
detection of the desired activity. The host cell can be a higher eukaryotic cell, such as a 
mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell can be 
effected by transformation, calcium phosphate transfection, DEAE-Dextran mediated 
transfection, DMSO or electroporation (Davis, L., Dibner, M., Battey, I., Basic Methods in 
Molecular Biology, (1986)). 

As representative examples of appropriate hosts, there may be mentioned: bacterial cells, 
such as E. coli. Bacillus, Streptomyces, Salmonella typhimurium; fungal cells, such as yeast; 
insect cells such as Drosophila S2 and Spodoptera SJ9; animal cells such as CHO, COS or 
Bowes melanoma; adenoviruses; plant cells, etc. 

The selection of an appropriate host is deemed to be within the scope of those skilled in the art 
from the teachings herein. 
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Host cells are genetically engineered (transduced or transformed or transfected) the 
vectors. The engineered host cells can be cultured in conventional nutrient media modified as 
appropriate for activating promoters, selecting transfonnants or amplifying genes. The cukure 
conditions, such as temperature, pH and the like, are those previously used with the host cell 
selected for expression, and will be apparent to the ordinarily skilled artisan. 

The library may be screened for a specified protein, e.g. enzyme, activity by procedures 
known in the art. For example, the protein, e.g enzyme, activity may be screened for one or more 
of the six lUB classes; oxidoreductases, transferases, hydrolases, lyases, isomerases and iigases. 
The recombinant proteins, e.g. enzymes, which are determined to be positive for one or more of 
the lUB classes may then be rescreened for a more specific protein, e.g. enzyme, activity. 

Alternatively, the library may be screened for a more specialized protein, e.g enzyme, 
activity. For example, instead of generically screening for hydrolase activity, the library may be 
screened for a more specialized activity, i.e. the type of bond on which the hydrolase acts. Thus, 
for example, the library may be screened to ascertain those hydrolases which act on one or more 
specified chemical functionalities, such as: (a) amide (peptide bonds), i.e. proteases; (b) ester 
bonds, i.e. esterases and lipases; (c) acetals, i.e, glycosidases etc. 

The clones which are identified as having the specified protein, e g. enzyme, activity may 
then be sequenced to identify the DNA sequence encoding an protein, e.g. enzyme, having the 
specified activity. Thus, in accordance with the present invention it is possible to isolate and 
identify: (i) DNA encoding an protein, e.g enzyme, having a specified protein, e.g. enzyme, 
activity, (ii) proteins, e.g. enzymes, having such activity (inlcuding the amino acid sequence 
thereof) and (iii) produce recombinant proteins, e.g. enzymes, having such activity. 

The present invention may be employed for example, to identify uncultured 
microorganisms with proteins, e.g. enzymes, having, for example, the following activities which 
may be employed for the following uses: 
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Lipase/Esterase 

a. Enantioselective hydrolysis of esters (lipids)/ thioesters 

1 ) Resolution of racemic mixtures 

2) Synthesis of optically active acids or alcohols from mesodiesters 

b. Selective syntheses 

1 ) Regiospecific hydrolysis of carbohydrate esters 

2) Selective hydrolysis of cyclic secondary alcohols 

c. Synthesis of optically active esters, lactones, acids, alcohols 

1 ) Transesterification of activated/nonactivated esters 

2) Interesterification 

3) Optically active lactones from hydroxyesters 

4) Regio- and enantioselective ring opening of anhydrides 

d. Detergents 

e. Fat/Oil conversion 

f. Cheese ripening 

Protease 

a. Ester/amide synthesis 

b. Peptide synthesis 

c. Resolution of racemic mixtures of amino acid esters 

d. Synthesis of non-natural amino acids 

e. Detergents/protein hydrolysis 

Glycosidase/Glycosyl transferase 

a. Sugar/polymer synthesis 

b. Cleavage of glycosidic linkages to form mono, all-and oligosaccharides 

c. Synthesis of complex oligosaccharides 

d. Glycoside synthesis using UDP-galactosyl transferase 

e. Transglycosylation of disaccharides, glycosyl fluorides, aryl galactosides 
f Glycosyl transfer in oligosaccharide synthesis 
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1. 



Diastereoselective cleavage of p-glucosylsulfoxides 
Asymmetric glycosylations 
Food processing 
Paper processing 



Phosphatase/Kinase 



a. 



Synthesis/hydrolysis of phosphate esters 
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1 ) Regio-, enantioselective phosphorylation 

2) hitroduction of phosphate esters 

3) Synthesize phospholipid precursors 

4) Controlled polynucleotide synthesis 

b. Activate biological molecule 

c. Selective phosphate bond formation v^thout protecting groups 
Mono/Dioxygenase 

a. Direct oxyfunctionalization of unactivated organic substrates 

b. Hydroxylation of alkane, aromatics, steroids 

c. Epoxidation of alkenes 

d. Enantioselective sulphoxidation 

e. Regio- and stereoselective Bayer-Villiger oxidation 

Haloperoxidase 

a. Oxidative addition of halide ion to nucleophilic sites 

b. Addition of hypohalous acids to olefmic bonds 

c. Ring cleavage of cyclopropanes 

d. Activated aromatic substrates converted to ortho and para derivatives 

e. 1 .3 diketones converted to 2-halo-derivatives 

f. Heteroatom oxidation of sulfur and nitrogen containing substrates 
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g. Oxidation of enol acetates, alkynes and activated aromatic rings 



Lignin peroxidase/Diarylpropane peroxidase 

a. Oxidative cleavage of C-C bonds 

b. Oxidation of benzylic alcohols to aldehydes 
c Hydro xylation of benzylic carbons 

d. Phenol dimerization 

e. Hydroxylation of double bonds to form diols 
f Cleavage of lignin aldehydes 

Epoxide hydrolase 

a. Synthesis of enantiomerically pure bioactive compounds 

b. Regio- and enantioselective hydrolysis of epoxide Aromatic and olefmic 
epaxidation by monoaxygenases to form epoxides 

d. Resolution of racemic epoxides 

e. Hydrolysis of steroid epoxides 

Nitrile hydratase/nitriluse 

a. Hydrolysis of aliphatic nitriles to carboxamides 

b. Hydrolysis of aromatic, heterocyclic, unsaturated aliphatic nitriles to 
corresponding acids 

c. Hydrolysis of acrylonitrile 

d. Production of aromatic and carboxamides, carboxylic acids 
(nicotinamide, picolinamide, isonicotinamide) 

e. Regioselective hydrolysis of acrylic dinitrile 
f a-amino acids from a-hydroxynitriles 

Transaminase 

a. Transfer of amino groups into oxo-acids 
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Amidase/Acylase 

a. Hydrolysis of amides, amidines, and other C-N bonds 

b. Non-natural amino acid resolution and synthesis 



16 



Preparation of a Representative DNA Library 



The following outlines the procedures used to generate a gene library from a sample of 
the exterior surface of a whale bone found at 1240 meters depth in the Santa Catalina Basin 
during a dive expedition. 



Isolate DNA. 

ISOQUICK Procedure as per manufacturer's instructions. 



Shear DNA 



1 . Vigorously push and pull DNA through a 25G double-hub 
needle and 1-cc syringes about 500 times. 

2. Check a small amount (0.5 ^g) on a 0.8% agarose gel to make 

sure the majority of the DNA is in the desired size range (about 3-6 kb). 



Blunt DNA 

1. Add: 

HjO to a final volume of 405 ^il 

45 ^l 1 ox Mung Bean Buffer 

2.0|il Mung Bean Nuclease (150 |i/|J.l) 

2. Incubate 37°C, 15 minutes. 

3 . Phenol/chloroform extract once. 

4. Chloroform extract once. 

5. Add 1 ml ice cold ethanol to precipitate. 

6. Place on ice for 10 minutes. 

7. Spin in micro fuge, high speed, 30 minutes. 

8. Wash v^ith 1 ml 70% ethanol, 

9. Spin in microfuge, high speed, 1 0 minutes and dry. 
to a final volume of 4.0 |il 

17 



Methylate DNA 



Gently resuspend DNA in 26 ^1 TE. 



2. 



Add: 



4.0 |il 1 OX EcoR I Methylase Buffer 

0.5 lal SAM(32mM) 

5.0 |il EcoR I Methylase (40 ^/[il) 



3. 



Incubate 37°, 1 hour 



Insure Blunt Ends 



I. 



Add to the methylation reaction: 



5.0 |il lOOmMMgClj 

8.0^1 dNTP mix (2.5 mM of each dGTP, dATP, dTTP, dCTP) 
4.0 III Klenow(5 n//il) 

2. hicubate 12°C, 30 minutes. 

3. Add450|allXSTE. 

4. Phenol/chloroform extract once. 

5. Chloroform extract once. 

6. Add 1 ml ice cold ethanol to precipitate and place on ice for 10 minutes. 

7. Spin in microfiige, high speed, 30 minutes. 

8. Wash with 1 ml 70% ethanol. 

9. Spin in microfiige, high speed, 10 minutes and dry. 

Linker Ligation 

1 . Gently resuspend DNA in 7 |al Tris-EDTA (TE). 

2. Add: 



14|al 
3.0 \i\ 
3.0 ^il 
3.0 ^l 



T4 DNA Ligase (4Wu/nl) 



Phosohorvlated EcoR I linkers (200 ngy^l) 
lOX Litigation Buffer 



lOmMrATP 



3. 



Incubate 4°C, overnight. 
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EcoRl Cutback 

1 . Heat kill ligation reaction 68°C, 10 minutes. 

2. Add: 

237.9 H2O 

30^1 lOX £co7? I Buffer 

2.1 )il EcoR I Restriction Enzyme ( 100 u/jil) 

3. Incubate 3 7°C, 1.5 hours. 

4. Add 1.5 ^10.5 MEDTA. 

5. Place on ice. 

Sucrose Gradient (2.2 ml) Size Fractionation 

1 . Heat sample to 65°C, 10 minutes. 

2. Gently load on 2.2 ml sucrose gradient. 

3. Spin in mini-ultracentriflige, 45K, 20°C, 4 hours (no brake). 

4. Collect fractions by puncturing the bottom of the gradient tube with a 20G needle 
and allowing the sucrose to flow through the needle. Collect the first 20 drops in a 
Falcon 2059 tube then collect 10 1-drop fractions (labelled 1-10). Each drop is 
about 60 |il in volimie. 

5. Run 5 1^1 of each fraction on a 0.8% agarose gel to check the 
size. 

6. Pool fractions 1-4 (-10-L5 kb) and, in a separate tube, pool 
fractions 5-7 (about 5-0.5 kb). 

7. Add 1 ml ice cold ethanol to precipitate and place on ice for 10 minutes. 

8. Spin in microflige, high speed, 30 minutes. 

9. Wash with 1 ml 70% ethanol. 

10. Spin in microfiige, high speed, 10 minutes and dry. 

1 1 . Resuspend each in 1 0 |al TE buffer. 
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Test Ligation to Lambda Arms 

1 . Plate assay to get an approximate concentration. Spot 0.5 pi of the sample on 
agarose containing ethidium bromide along with standards (DN A samples of 
known concentration). View in UV light and estimate concentration compared to 
the standards. 

Fraction 1-4 = > 1.0 ^g/|il 1. Fraction 5-7 = 500 ng/|iL 

2. Prepare the following ligation reactions (5 pi reactions) and 
incubate 4°C, overnight: 



Sample 


H2O 


lOX 


lOmM 


Lambda 


Insert 


T4DNA 






Ligase 


rATP 


arms 


DNA 


Ligase (4 






Buffer 




(gtl 1 and 














ZAP) 






Fraction 1-4 


0.5 [il 


0.5 ^il 


0.5 ^il 


1.0 ^1 


2.0 [il 


0.5 111 


Fraction 5-7 


0.5 III 


0.5 ^il 


0.5 |il 


1.0 [il 


2.0 ^il 


-0.5 ^il 



Test Package and Plate 

1 . Package the ligation reactions following manufacturer's protocol. Package 2.5 ^il per 
packaging extract (2 extracts per ligation). 

2. Stop packaging reactions with 500 |il SM buffer and pool packaging that came from the 
same ligation. 

3. Titer 1 ,0 ^1 of each on appropriate host (0D6W == 1 .0) [XLIBlue MRF for ZAP® 
(Stratagene) and Y1088 for gtl 1] 

Add 200 |il host (in mM MgS04) to Falcon 2059 tubes 
Inoculate with 1 |il packaged phage 
hicubate 37°C, 15 minutes 
Add about 3 ml 48°C top agar 

[50 ml stock containing 150 |il IPTG (0.5M) and 300 ^l X-GAL (350 mg/ml)] 
Plate on 100mm plates and incubate 37°C, overnight. 
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4. Efficiency results: 

gtll: 1 .7 X 10'* recombinants with 95% 

background 

ZAP ® (Stratagene): 4.2 x 10^ recombinants with 66% 

5 background 

Contaminants in the DNA sample may have inhibited the enzymatic reactions, though the 
sucrose gradient and organic extractions may have removed them. Since the DNA sample 
was precious, an effort was made to "fix" the ends for cloning: 

10 Re-Blunt DNA 

0 1 . Pool all left over DNA that was not ligated to the lambda arms (Fractions 1 -7) and 
If n add H2O to a final volume of 12 |il . Then add: 

\^ 143 |il H2O 

15 m 20 |il lOX Buffer 2 (fi-om Stratagene's cDNA Synthesis Kit) 

23 ^il Blunting dNTP (from Stratagene's cDNA Synthesis Kit) 
\^ 2.0 |il Pfti (fi-om Stratagene"s cDNA Synthesis 

\1 2. Incubate 72°C, 30 minutes. 

1 3 . Phenol/chloroform extract once. 
20 0 4. Chloroform extract once. 

5. Add 20 |iL 3M NaOAc and 400 |al ice cold ethanol to precipitate. 

6. Place at -20°C, overnight. 

7. Spin in micro fiige, high speed, 30 minutes. 

8. Wash with 1 ml 70% ethanol. 

25 9. Spin in microfuge, high speed, 10 minutes and dry. 

(Do NOT Methylate DNA since it was already methylated in the first round of processing) 
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Adaptor Ligation 

1 . Gently resuspend DNA in 8 111 EcoR I adaptors (from Stratagene's cDNA 
Synthesis Kit). 

2. Add: 



3. Incubate 4°C, 2 days. 
NOT cutback since using ADAPTORS this time. Instead, need to phosphorylate) 

Phosphorylate Adaptors 

1 . Heat kill ligation reaction 70°C, 30 minutes. 
Add: 



1.0 lal lOX Ligation Buffer 

1.0 III lOmMrATP 

1 .0 lal 14 DNA Ligase (4Wu/^l) 



1.0 \il 
2.0 ^il 
6.0 lal 
1.0 [il 



lOX Ligation Buffer 



lOmMrATF 



PNK (from Stratagene's cDNA Synthesis Kit). 



2. 



4. 



3. 



Incubate 37°C, 30 minutes. 

Add 3 1 III HP and 5 [il lOX STE. 

Size fractionate on a Sephacryl S-500 spin column (pool fractuions 1-3). 



5. 



Phenol/chloroform extract once. 



6. 



Chloroform extract once. 



7. Add ice cold ethanol to precipitate. 

8. Place on ice, 10 minutes. 

9. Spin in microfuge, high speed, 30 minutes. 

10. Wash with 1 ml 70% ethanol. 

1 1 . Spin in micro ftige, high speed, 10 minutes and dry. 

12. Resuspend in 10.5 |il TE buffer. 



22 



Do not plate assay. Instead, ligate directly to arms as above escept use 2.5 of DNA and no 
water 

Package and titer as above. 

Efficiency results: 



ZAP ® (Stratagene): 9.6 x 10^ recombinants with 0% background 

Amplification of Libraries (5.0 x 10^ recombinants from each library) 

1 . Add 3.0 ml host cells (OD^eo = 1 .0) to two 50 ml conical tube. 

2. Inoculate with 2.5 X 10^ pfii per conical tube. 

3. Incubate 37°C, 20 minutes. 

4. Add top agar to each tube to a final volume of 45 ml. 

5. Plate the tube across five 150 mm plates. 

6. Incubate 37°C, 6-8 hours or until plaques are about pin-head in size. 

7. Overlay with 8-10 ml SM Buffer and place at 4°C overnight (with gentle rocking 
if possible). 

Harvest Phage 

1 . Recover phage suspension by pouring the SM buffer off each plate into a 50-ml 
conical tube. 

2. Add 3 ml chloroform, shake vigorously and incubate at room 
temperature, 15 minutes. 

3. Centriftige at 2K rpm, 10 minutes to remove cell debris. 

4. Pour supernatant into a sterile flask, add 500 pi chloroform. 

5. Store at 4°C. 

Titer Amplified Library 

1 . Make serial dilutions; 



gtll: 



2.5 X 10' 



recombinants with 2.5% background 



10-^ = 



1 (il amplified phage in 1 ml SM Buffer 
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10-^ = 1 ^il of the 10'^ dilution in 1 ml SM Buffer 

2. Add 200 ^1 host (in 10 mM MgS04) to two tubes 

3. Inoculate one with 10 |il lO"^ dilution (10'^). 

4. Inoculate the other with 1 |il 10"^ dilution (10"^). 

5. Incubate 37°C, 15 minutes. 

6. Add about 3 ml 48°C top agar. 

[50 ml stock containing 150 |il IPTG (0.5M) and 375 jil 
X-GAL (350 mg/ml)] 

7. Plate on 100 mm plates and incubate 37°C, overnight 

8. Results: 

gtl 1: IJxlO^Vml 
ZAP® (Stratagene): 2.0x1 0^**/ml 

Example 2 
Enzymatic Activity Assay 

The following is a representative example of a procedure for screening an expression 
library prepared in accordance with Example 1 . In the following, the chemical characteristic 
Tiers are as follows: 
Tier 1 : Hydrolase 

Tier 2: Amide, Ester and Acetal 

Tier 3: Divisions and subdivisions are based upon the differences between individual 

substrates which are covalently attached to the functionality of Tier 2 undergoing reaction; as 
well as substrate specificity. 

Tier 4: The two possible enantiomeric products which the protein, e.g. enzyme, may 

produce from a substrate. 



• 
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Although the following example is specitlcally directed to the above mentioned tiers, the general 
procedures for testing for various chemical characteristics is generally applicable to substrates 
other than those specifically referred to in this Example. 

Screening for Tier 1-hydrolase; Tier 2-aniide. Plates of the library prepared as described in 
Example 1 are used to multiply inoculate a single plate containing 200 |il of LB Amp/Meth, 
glycerol in each well. This step is performed using the High Density Replicating Tool (HDRT) 
of the Beckman Biomek with a 1% bleach, water, isopropanol, air-dry steriHzation cycle 
between each inoculation. The single plate is grown for 2h at 3 7*^0 and is then used to inoculate 
two white 96-well Dynatech microtiter daughter plates containing 250 [i\ of LB Amp/Meth, 
glycerol in each well. The original single plate is incubated at 37°C for 18h, then stored at 80°C. 
The two condensed daughter plates are incubated at 37°C also for 18 h. The condensed daughter 
plates are then heated at 70°C for 45 min. to kill the cells and inactivate the host E.coli proteins, 
e.g, enzymes. A stock solution of 5mg/mL morphourea phenylalanyl-7-amino-4-trifluoromethyl 
coumarin (MuPheAFC, the 'substrate') in DMSO is diluted to 600 |iM with 50 mM pH 7.5 
Hepes buffer containing 0.6 mg/mL of the detergent dodecyl maltoside. 



Fifty |iL of the 600 |iM MuPheAFC solution is added to each of the wells of the white 
condensed plates v^th one 100 [iL mix cycle using the Biomek to yield a final concentration of 
substrate of ^100 |aM. The fluorescence values are recorded (excitation = 400 nm, emission = 
505 nm) on a plate reading fluorometer immediately after addition of the substrate (t=0). The 
plate is incubated at 70°C for 100 min, then allowed to cool to ambient temperature for 15 
additional minutes. The fluorescence values are recorded again (t=100). The values at t=0 are 
subtracted from the values at t=100 to determine if an active clone is present. 




MuPheAFC 
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The data will indicate whether one of the clones in a particular well is hydrolyzing the 
substrate. In order to determine the individual clone which carries the activity, the source library 
plates are thawed and the individual clones are used to singly inoculate a new plate containing 
LB Amp/Meth, glycerol. As above, the plate is incubated at 37°C to grow the cells, heated at 
5 70°C to inactivate the host proteins, e.g. enzymes, and 50 ixL of 600 |iM MuPheAFC is added 
using the.Biomek. Additionally three other substrates are tested. They are methyl umbelliferone 
heptanoate. the CBZ-arginine rhodamine derivative, and fluorescein-conjugated casein (-3.2 
mol fluorescein per mol of casein). 




mmihyi umtxIUferooe bepCuMUr (CBZ^rfialiM), rhodamine 110 

^5:^ The umbelliferone and rhodamine are added as 600 |uM stock solutions in 50 jiL of Hepes 

buffer. The fluorescein conjugated casein is also added in 50 |iL at a stock concentration of 20 
Q and 200 mg/mL. After addition of the substrates the t=0 fluorescence values are recorded, the 
]r} plate is incubated at 70°C, and the t=100 min. values are recorded as above. 

20 These data indicate which plate the active clone is in, where the arginine rhodamine 

derivative is also turned over by this activity, but the lipase substrate, methyl umbelliferone 
heptanoate, and protein, fluorescein-conjugated casein, do not function as substrates, the Tier I 
classification is ^hydrolase' and the Tier 2 classification is amide bond. No cross reactivity should 
be seen with the Tier 2-ester classification. 

25 

As shown in Figure 1 , a recombinant clone from the library which has been characterized 
in Tier 1 as hydrolase and in Tier 2 as amide may then be tested in Tier 3 for various 
specificities. In Figure 1, the various classes of Tier 3 are followed by a parenthetical code 
which identifies the substrates of Table 1 which are used in identifying such specificities of Tier 

30 3. 
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As shown in Figures 2 and 3, a recombinant clone from the library which has been 
characterized in Tier 1 as hydrolase and in Tier 2 as ester may then be tested in Tier 3 for various 
specificities. In Figures 2 and 3, the various classes of Tier 3 are followed by a parenthetical 
code which identities the substrates of Tables 3 and 4 which are used in identifying such 
5 specificities of Tier 3. In Figures 2 and 3, R2 represents the alcohol portion of the ester and R' 
represents the acid portion of the ester. 

As shown in Figure 4, a recombinanl clone from the library which has been characterized 
in Tier 1 as hydrolase and in Tier 2 as acetal may then be tested in Tier 3 for various 
10 _ specificities. In Figure 3, the various classes of Tier 3 are followed by a parenthetical code which 
l E identifies the substrates of Table 5 which are used in identifying such specificities of Tier 3. 

^ f Proteins, e.g enzymes, may be classified in Tier 4 for the chirality of the product(s) 

ill produced by the enzyme. For example, chiral amino esters may be determined using at least the 
15 following substrates: 

CHj-COr C Ha H Vmi 

For each substrate which is turned over the enantioselectivity value, E, is determined according 
to the equation below: 

ln[(l-c(l+eep)] 
ln[(l-c(l-ecp)] 

where eCp = the enantiomeric excess (ee) of the hydrolyzed product and c = the percent 
conversion of the reaction. See Wong and Whitesides, Proteins, e.g. enzymes, in Synthetic 
30 Organic Chemistry, 1 994, Elsevier, Tarryto vm. New York, pp. 9- 1 2. 
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The enantiomeric excess is determined by either chiral high performance liquid chromatography 
(HPLC) or chiral capillary electrophoresis (CE). Assays are performed as follows: two hundred 
pL of the appropriate buffer is added to each well of a 96-well white microtiter plate, followed 
by 50 |.iL of partially or completely purified protein, e.g enzyme, solution; 50 ixL of substrate is 
added and the increase in fluorescence monitored versus time until 50% of the substrate is 
consumed or the reaction stops, whichever comes first. 

Example 3 

Construction of a Stable, Large Insert Picoplankton Genomic DNA Library 

Figure 5 shows an overview of the procedures used to construct an environmental library 
from a mixed picoplankton sample. A stable, large insert DNA library representing picoplankton 
genomic DNA was prepared as follows. 

Cell collection and preparation of DNA. Agarose plugs containing concentrated 
picoplankton cells were prepared from samples collected on an oceanographic cruise from 
Newport, Oregon to Honolulu, Hawaii. Seawater (30 liters) was collected in Niskin bottles, 
screened through 10 |im Nitex, and concentrated by hollow fiber filtration (Amicon DCIO) 
through 30,000 MW cutoff polyfulfone filters. The concentrated bacterioplankton cells were 
collected on a 0.22 Um, 47 mm Durapore filter, and resuspended in 1 ml of 2X STE buffer (IM 
NaCI, O.IM EDTA, 10 mM Tris, pH 8.0) to a final density of approximately 1x10*° cells per 
ml. The cell suspension was mixed with one volume of 1% molten Seaplaque LMP agarose 
(FMC) cooled to 40°C, and then immediately drawn into a 1 ml syringe. The syringe was sealed 
with parafilm and placed on ice for 10 min. The cell-containing agarose plug was extruded into 
10 ml of Lysis Buffer (lOmM Tris pH 8.0, 50 mM NaCI, O.IM EDTA, 1% Sarkosyl, 0.2% 
sodium deoxycholate, 1 mg/ml lysozyme) and incubated at 37°C for one hour. The agarose plug 
was then transferred to 40 mis of ESP Buffer (1% Sarkosyl, 1 mg/ml proteinase K, in 0.5M 
EDTA), and incubated at 55°C for 16 hours. The solution was decanted and replaced with fresh 
ESP Buffer, and incubated at 55°C for an additional hour. The agarose plugs were then placed in 
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50 mM EDTA and stored atM°C shipboard for the duration of the oceanographic cruise. 

One slice of an agarose plug (72 |il) prepared from a sample collected off the Oregon 
coast was dialyzed overnight at 4°C against I mL of buffer A (1 OOmM NaCI, lOmM Bis Tris 
Propane-HCl 100 |ig/ml acetylated BSA: pH 7.0 25°C) in a 2 mL microcentrifuge tube. The 
solution was replaced with 250 |.il of fresh buffer A containing 10 mM MgCU and 1 mM DTT 
and incubated on a rocking platform for 1 hr at room temperature. The solution was then 
changed to 250 \x\ of the same buffer containing 4U of Sau3Al (NEB), equilibrated to 37 °C in a 
water bath, and then incubated on a rocking platform in a 37°C incubator for 45 min. The plug 
was transferred to a 1.5 ml microcentrifuge tube and incubated at 68°C for 30 min to inactivate 
the protein, e.g. enzyme, and to melt the agarose. The agarose was digested and the DNA 
dephosphorylased using Gelase and HK-phosphatase (Epicentre), respectively, according to the 
manufacturer's recommendations. Protein was removed by gentle phenol/chloroform extraction 
and the DNA was ethanol precipitated, pelleted, and then washed with 70% ethanol. This 
partially digested DNA was resuspended in sterile H2O to a concentration of 2.5 ng/|il for 
ligation to the pFOSl vector. 

PGR amplification results from several of the agarose plugs (data not shown) indicated 
the presence of significant amounts of archaeal DNA. Quantitative hybridization experiments 
20 using rRNA extracted from one sample, collected at 200 m of depth off the Oregon Coast, 

indicated that planktonic archaea in (this assemblage comprised approximately 4.7% of the total 
picoplankton biomass (this sample corresponds to "PACr'-200 m in Table 1 of DeLong etal, 
high abundance of Archaea in Antarctic marine picoplankton, Nature^ 377:695-698, 1994). 
Results from archaeal-biased rDNA PGR amplification performed on agarose plug lysates 
25 confirmed the presence of relatively large amounts of archaeal DNA in this sample. Agarose 
plugs prepared from this picoplankton sample were chosen for subsequent fosmid library 
preparation. Each 1 ml agarose plug from this site contained approximately 7.5 x 10^ cells, 
therefore approximately 5.4 x 10^ cells were present in the 72 \i\ slice used in the preparation of 
the partially digested DNA. 

30 
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Vector amis were prepared from pFOSI as described (Kim et ai. Stable propagation of 
casmid sized human DNA inserts in an F factor based vector, Nucl. Acids Res., 20:10832-10835, 
1992). Briefly, the plasmid was completely digested with Astll, dephosphorylated with HK 
phosphatase, and then digested with BamHI to generate two anns, each of which contained a cos 
site in the proper orientation for cloning and packaging ligated DNA between 35-45 kbp. The 
partially digested picoplankton DNA, isolated by partial fragment gel electrophoresis (PFGE), 
was ligated overnight to the PFOSI arms in a 15 |al ligation reaction containing 25 ng each of 
vector and insert and lU of T4 DNA ligase (Boehringer-Mannheim). The ligated DNA in four 
' V I ' microliters of this reaction was in vitro packaged using the Gigapack XL packaging system 

Vstratagene), the fb-smid particles trmi^ecjed to E. coli strain DHIOB (BRL), and the cells spread 
^ /onto LBc^i5 plates. The resultant fosmi4 clones were picked into 96-well microliter dishes 

containing LB^^jg supplemented with 7% glycerol. Recombinant fbsmtds, each containing cat 40 
kb of picoplankton DNA insert, yielded a library of 3,552 fcsmid clones, containing 
approximately 1.4 x 10^ base pairs of cloned DNA. All of the clones examined contained inserts 
ranging from 38 to 42 kbp. This library was stored frozen at -80°C for later analysis. 

Numerous modifications and variations of the present invention are possible in light of 
the above teachings; therefore, within the scope of the claims, the invention may be practiced 
other than as particularly described. 
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Table 1 



A2 



Fliit'fctccin coniugaied casein (3.2 mol nii«»fc$Qctn/rrn»l cwcint 

CBZAXt'AMC 

tBOC-Ai«-AU-Acp.AMC 

succinyl-AJa Gly l.cu-AMC 

CBZ-Arg-AMC 

CBZ-Mc«-AMC 

oioqshoiffec-PheAMC 

l-BOC « t-buioxy cvbonyt CB2 « c«rixHiyl benxyloxy. 
AMC = 7-«niino-4 -methyl coufnarin 



AC3 

o 
o 






AD3 



Rucrcscetn conjugtted casein 



c BOC- Ala-AU-Aip-Af C 
CBZ- AU-AU-Lyi-AFC 



lUGcinyl-AU-AU-Phe-AFC 
iuccinylAla-Gly-Leu-AFC 



AFC s T-cmino-^-frinuoromeUiyl cnumvtn.) 




«ucdnyUAi«-AU.f*he-AFC 
CBZ ri«-AFC 
CBZ-Trp-AFC 



AH3 



AF3 

l-BOC- AU-AU-Axp.AFC 
CBZA^pAFC 



AM 



AG3 

CBZ AU-AJ».Lyf AFC 
CB7. Ax^-AFC 



succinyl-Alj-CIyLeuAR' 

cbzauak: 

C07 Sewr-Aft 
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Table 3 



LI 13 ^ ^ Ll.l 



And all ofL2 



U3 



LK3 LL3 

o 



LN3 

L03 



|C07-CHjPft H 
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Table 4 




4-mcthyl umbelliferone 
wherein R = 



G2 ^-D-galaciosc 

^-D-glucose 

^-D-glucuronidc 
GB3 ^-D-ccllotrioside 

^-B-cellobiopyranoside 
GC3 ;3-D-galacrosc 

a-D-galactosc 
GD3 /3-D-giucasc 

a-D-glucosc 
GE3 ^-D-glucuronidc 
GI3 ^-D-N,N-diacctylchitobiose 
GJ3 ^-D-fucose 

a-L-fucosc 

iS-L-fucosc 
GK3 ^e-D-mannosc 

Qf-D-mannose 



non-UmbcIIifcryl substrates 

GA3 amylose [polyglucan al,4 linkages], amylopcctin 

(polyglucan branching a 1,6 linkages] 
GF3 xylan [poly 1,4-D-xylan] 

GG3 amylopcctin, pullulan 

GH3 sucrose, fruciofuranoside 
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